Towards Safe Control of Continuum Manipulator Using Shielded Multiagent Reinforcement Learning
نویسندگان
چکیده
Continuum robotic manipulators are increasingly adopted in minimal invasive surgery. However, their nonlinear behavior is challenging to model accurately, especially when subject external interaction, potentially leading poor control performance. In this letter, we investigate the feasibility of adopting a model-free multiagent reinforcement learning (RL), namely deep Q network (MADQN), 2-degree freedom (DoF) cable-driven continuum surgical manipulator. The robot formulated as one DoF, agent problem MADQN framework improve efficiency. Combined with shielding scheme that enables dynamic variation action set boundary, leads efficient and importantly safer robot. Shielded enabled perform point trajectory tracking submillimeter root mean square errors under loads, soft obstacles, rigid collision, which common interaction scenarios encountered by manipulators. controller was further proven be effective miniature high structural nonlinearitiy, achieving accuracy payload.
منابع مشابه
Reinforcement Learning for Multi - Linked Manipulator Control
We present an automatic trajectory planning and obstacle avoidance method for a multi-linked manipulator which uses position and velocity sensor information directly to produce the appropriate real-valued torques for each joint. The inputs are fed into a Cerebellar Model Arithmetic Computer (CMAC) [1] and in each state, the expected reward and torques for each joint are learnt through self-expe...
متن کاملMultiagent Reinforcement Learning for Urban Traffic Control Using Coordination Graphs
Since traffic jams are ubiquitous in the modern world, optimizing the behavior of traffic lights for efficient traffic flow is a critically important goal. Though most current traffic lights use simple heuristic protocols, more efficient controllers can be discovered automatically via multiagent reinforcement learning, where each agent controls a single traffic light. However, in previous work ...
متن کاملLyapunov Design for Safe Reinforcement Learning Control
We propose a general approach to safe reinforcement learning control based on Lyapunov design methods. In our approach, a Lyapunov function—a special form of domain knowledge—is used to formulate the action choices available to a reinforcement learning agent. A learning agent choosing among these actions provably enjoys performance guarantees, and satisfies safety constraints of various kinds. ...
متن کاملAsymmetric Multiagent Reinforcement Learning
A novel model for asymmetric multiagent reinforcement learning is introduced in this paper. The model addresses the problem where the information states of the agents involved in the learning task are not equal; some agents (leaders) have information how their opponents (followers) will select their actions and based on this information leaders encourage followers to select actions that lead to...
متن کاملA Survey on Multiagent Reinforcement Learning Towards Multi-Robot Systems
Multiagent reinforcement learning for multirobot systems is a challenging issue in both robotics and artificial intelligence. With the ever increasing interests in theoretical research and practical applications, currently there have been a lot of efforts towards providing some solutions to this challenge. However, there are still many difficulties in scaling up multiagent reinforcement learnin...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE robotics and automation letters
سال: 2021
ISSN: ['2377-3766']
DOI: https://doi.org/10.1109/lra.2021.3097660